Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 41188 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 6.9 MiB |
| Average record size in memory | 176.0 B |
Variable types
| CAT | 11 |
|---|---|
| NUM | 11 |
euribor3m is highly correlated with emp.var.rate and 1 other fields | High correlation |
emp.var.rate is highly correlated with euribor3m and 1 other fields | High correlation |
nr.employed is highly correlated with emp.var.rate and 1 other fields | High correlation |
Unnamed: 0 has unique values | Unique |
previous has 35563 (86.3%) zeros | Zeros |
Reproduction
| Analysis started | 2020-11-14 18:01:40.212978 |
|---|---|
| Analysis finished | 2020-11-14 18:02:19.663952 |
| Duration | 39.45 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 41188 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20593.5 |
|---|---|
| Minimum | 0 |
| Maximum | 41187 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2059.35 |
| Q1 | 10296.75 |
| median | 20593.5 |
| Q3 | 30890.25 |
| 95-th percentile | 39127.65 |
| Maximum | 41187 |
| Range | 41187 |
| Interquartile range (IQR) | 20593.5 |
Descriptive statistics
| Standard deviation | 11890.09578 |
|---|---|
| Coefficient of variation (CV) | 0.5773712958 |
| Kurtosis | -1.2 |
| Mean | 20593.5 |
| Median Absolute Deviation (MAD) | 10297 |
| Skewness | 0 |
| Sum | 848205078 |
| Variance | 141374377.7 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 34042 | 1 | < 0.1% | |
| 38232 | 1 | < 0.1% | |
| 11599 | 1 | < 0.1% | |
| 9550 | 1 | < 0.1% | |
| 15693 | 1 | < 0.1% | |
| 13644 | 1 | < 0.1% | |
| 3403 | 1 | < 0.1% | |
| 1354 | 1 | < 0.1% | |
| 7497 | 1 | < 0.1% | |
| Other values (41178) | 41178 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 41187 | 1 | < 0.1% | |
| 41186 | 1 | < 0.1% | |
| 41185 | 1 | < 0.1% | |
| 41184 | 1 | < 0.1% | |
| 41183 | 1 | < 0.1% |
age
Real number (ℝ≥0)
| Distinct | 78 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.02406041 |
|---|---|
| Minimum | 17 |
| Maximum | 98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 17 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 32 |
| median | 38 |
| Q3 | 47 |
| 95-th percentile | 58 |
| Maximum | 98 |
| Range | 81 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 10.42124998 |
|---|---|
| Coefficient of variation (CV) | 0.2603746315 |
| Kurtosis | 0.7913115312 |
| Mean | 40.02406041 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.7846968158 |
| Sum | 1648511 |
| Variance | 108.6024512 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 31 | 1947 | 4.7% | |
| 32 | 1846 | 4.5% | |
| 33 | 1833 | 4.5% | |
| 36 | 1780 | 4.3% | |
| 35 | 1759 | 4.3% | |
| 34 | 1745 | 4.2% | |
| 30 | 1714 | 4.2% | |
| 37 | 1475 | 3.6% | |
| 29 | 1453 | 3.5% | |
| 39 | 1432 | 3.5% | |
| Other values (68) | 24204 | 58.8% |
| Value | Count | Frequency (%) | |
| 17 | 5 | < 0.1% | |
| 18 | 28 | 0.1% | |
| 19 | 42 | 0.1% | |
| 20 | 65 | 0.2% | |
| 21 | 102 | 0.2% |
| Value | Count | Frequency (%) | |
| 98 | 2 | < 0.1% | |
| 95 | 1 | < 0.1% | |
| 94 | 1 | < 0.1% | |
| 92 | 4 | < 0.1% | |
| 91 | 2 | < 0.1% |
job
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| "admin." | |
|---|---|
| "blue-collar" | |
| "technician" | |
| "services" | |
| "management" | |
| Other values (7) |
| Value | Count | Frequency (%) | |
| "admin." | 10422 | 25.3% | |
| "blue-collar" | 9254 | 22.5% | |
| "technician" | 6743 | 16.4% | |
| "services" | 3969 | 9.6% | |
| "management" | 2924 | 7.1% | |
| "retired" | 1720 | 4.2% | |
| "entrepreneur" | 1456 | 3.5% | |
| "self-employed" | 1421 | 3.5% | |
| "housemaid" | 1060 | 2.6% | |
| "unemployed" | 1014 | 2.5% | |
| Other values (2) | 1205 | 2.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 10.95522968 |
| Min length | 8 |
marital
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| "married" | |
|---|---|
| "single" | |
| "divorced" | |
| "unknown" | 80 |
| Value | Count | Frequency (%) | |
| "married" | 24928 | 60.5% | |
| "single" | 11568 | 28.1% | |
| "divorced" | 4612 | 11.2% | |
| "unknown" | 80 | 0.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.831115859 |
| Min length | 8 |
education
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| "university.degree" | |
|---|---|
| "high.school" | |
| "basic.9y" | |
| "professional.course" | |
| "basic.4y" | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| "university.degree" | 12168 | 29.5% | |
| "high.school" | 9515 | 23.1% | |
| "basic.9y" | 6045 | 14.7% | |
| "professional.course" | 5243 | 12.7% | |
| "basic.4y" | 4176 | 10.1% | |
| "basic.6y" | 2292 | 5.6% | |
| "unknown" | 1731 | 4.2% | |
| "illiterate" | 18 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 21 |
|---|---|
| Median length | 13 |
| Mean length | 14.7109595 |
| Min length | 9 |
default
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| "no" | |
|---|---|
| "unknown" | |
| "yes" | 3 |
| Value | Count | Frequency (%) | |
| "no" | 32588 | 79.1% | |
| "unknown" | 8597 | 20.9% | |
| "yes" | 3 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 5.043702049 |
| Min length | 4 |
housing
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| "yes" | |
|---|---|
| "no" | |
| "unknown" | 990 |
| Value | Count | Frequency (%) | |
| "yes" | 21576 | 52.4% | |
| "no" | 18622 | 45.2% | |
| "unknown" | 990 | 2.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 4.644022531 |
| Min length | 4 |
loan
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| "no" | |
|---|---|
| "yes" | |
| "unknown" | 990 |
| Value | Count | Frequency (%) | |
| "no" | 33950 | 82.4% | |
| "yes" | 6248 | 15.2% | |
| "unknown" | 990 | 2.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 4.271875303 |
| Min length | 4 |
contact
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| "cellular" | |
|---|---|
| "telephone" |
| Value | Count | Frequency (%) | |
| "cellular" | 26144 | 63.5% | |
| "telephone" | 15044 | 36.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 10.36525202 |
| Min length | 10 |
month
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| "may" | |
|---|---|
| "jul" | |
| "aug" | |
| "jun" | |
| "nov" | |
| Other values (5) |
| Value | Count | Frequency (%) | |
| "may" | 13769 | 33.4% | |
| "jul" | 7174 | 17.4% | |
| "aug" | 6178 | 15.0% | |
| "jun" | 5318 | 12.9% | |
| "nov" | 4101 | 10.0% | |
| "apr" | 2632 | 6.4% | |
| "oct" | 718 | 1.7% | |
| "sep" | 570 | 1.4% | |
| "mar" | 546 | 1.3% | |
| "dec" | 182 | 0.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
day_of_week
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| "thu" | |
|---|---|
| "mon" | |
| "wed" | |
| "tue" | |
| "fri" |
| Value | Count | Frequency (%) | |
| "thu" | 8623 | 20.9% | |
| "mon" | 8514 | 20.7% | |
| "wed" | 8134 | 19.7% | |
| "tue" | 8090 | 19.6% | |
| "fri" | 7827 | 19.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
duration
Real number (ℝ≥0)
| Distinct | 1544 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 258.2850102 |
|---|---|
| Minimum | 0 |
| Maximum | 4918 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 36 |
| Q1 | 102 |
| median | 180 |
| Q3 | 319 |
| 95-th percentile | 752.65 |
| Maximum | 4918 |
| Range | 4918 |
| Interquartile range (IQR) | 217 |
Descriptive statistics
| Standard deviation | 259.2792488 |
|---|---|
| Coefficient of variation (CV) | 1.003849386 |
| Kurtosis | 20.24793801 |
| Mean | 258.2850102 |
| Median Absolute Deviation (MAD) | 94 |
| Skewness | 3.263141255 |
| Sum | 10638243 |
| Variance | 67225.72888 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 85 | 170 | 0.4% | |
| 90 | 170 | 0.4% | |
| 136 | 168 | 0.4% | |
| 73 | 167 | 0.4% | |
| 124 | 164 | 0.4% | |
| 87 | 162 | 0.4% | |
| 72 | 161 | 0.4% | |
| 104 | 161 | 0.4% | |
| 111 | 160 | 0.4% | |
| 106 | 159 | 0.4% | |
| Other values (1534) | 39546 | 96.0% |
| Value | Count | Frequency (%) | |
| 0 | 4 | < 0.1% | |
| 1 | 3 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 3 | < 0.1% | |
| 4 | 12 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4918 | 1 | < 0.1% | |
| 4199 | 1 | < 0.1% | |
| 3785 | 1 | < 0.1% | |
| 3643 | 1 | < 0.1% | |
| 3631 | 1 | < 0.1% |
campaign
Real number (ℝ≥0)
| Distinct | 42 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.567592503 |
|---|---|
| Minimum | 1 |
| Maximum | 56 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 56 |
| Range | 55 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.770013543 |
|---|---|
| Coefficient of variation (CV) | 1.078836903 |
| Kurtosis | 36.97979514 |
| Mean | 2.567592503 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.762506697 |
| Sum | 105754 |
| Variance | 7.672975028 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 17642 | 42.8% | |
| 2 | 10570 | 25.7% | |
| 3 | 5341 | 13.0% | |
| 4 | 2651 | 6.4% | |
| 5 | 1599 | 3.9% | |
| 6 | 979 | 2.4% | |
| 7 | 629 | 1.5% | |
| 8 | 400 | 1.0% | |
| 9 | 283 | 0.7% | |
| 10 | 225 | 0.5% | |
| Other values (32) | 869 | 2.1% |
| Value | Count | Frequency (%) | |
| 1 | 17642 | 42.8% | |
| 2 | 10570 | 25.7% | |
| 3 | 5341 | 13.0% | |
| 4 | 2651 | 6.4% | |
| 5 | 1599 | 3.9% |
| Value | Count | Frequency (%) | |
| 56 | 1 | < 0.1% | |
| 43 | 2 | < 0.1% | |
| 42 | 2 | < 0.1% | |
| 41 | 1 | < 0.1% | |
| 40 | 2 | < 0.1% |
pdays
Real number (ℝ≥0)
| Distinct | 27 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 962.475454 |
|---|---|
| Minimum | 0 |
| Maximum | 999 |
| Zeros | 15 |
| Zeros (%) | < 0.1% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 999 |
| Q1 | 999 |
| median | 999 |
| Q3 | 999 |
| 95-th percentile | 999 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 186.9109073 |
|---|---|
| Coefficient of variation (CV) | 0.194198103 |
| Kurtosis | 22.22946263 |
| Mean | 962.475454 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -4.922189916 |
| Sum | 39642439 |
| Variance | 34935.68728 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 999 | 39673 | 96.3% | |
| 3 | 439 | 1.1% | |
| 6 | 412 | 1.0% | |
| 4 | 118 | 0.3% | |
| 9 | 64 | 0.2% | |
| 2 | 61 | 0.1% | |
| 7 | 60 | 0.1% | |
| 12 | 58 | 0.1% | |
| 10 | 52 | 0.1% | |
| 5 | 46 | 0.1% | |
| Other values (17) | 205 | 0.5% |
| Value | Count | Frequency (%) | |
| 0 | 15 | < 0.1% | |
| 1 | 26 | 0.1% | |
| 2 | 61 | 0.1% | |
| 3 | 439 | 1.1% | |
| 4 | 118 | 0.3% |
| Value | Count | Frequency (%) | |
| 999 | 39673 | 96.3% | |
| 27 | 1 | < 0.1% | |
| 26 | 1 | < 0.1% | |
| 25 | 1 | < 0.1% | |
| 22 | 3 | < 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1729629989 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 35563 |
| Zeros (%) | 86.3% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4949010798 |
|---|---|
| Coefficient of variation (CV) | 2.861311858 |
| Kurtosis | 20.10881622 |
| Mean | 0.1729629989 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.832042243 |
| Sum | 7124 |
| Variance | 0.2449270788 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 35563 | 86.3% | |
| 1 | 4561 | 11.1% | |
| 2 | 754 | 1.8% | |
| 3 | 216 | 0.5% | |
| 4 | 70 | 0.2% | |
| 5 | 18 | < 0.1% | |
| 6 | 5 | < 0.1% | |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 35563 | 86.3% | |
| 1 | 4561 | 11.1% | |
| 2 | 754 | 1.8% | |
| 3 | 216 | 0.5% | |
| 4 | 70 | 0.2% |
| Value | Count | Frequency (%) | |
| 7 | 1 | < 0.1% | |
| 6 | 5 | < 0.1% | |
| 5 | 18 | < 0.1% | |
| 4 | 70 | 0.2% | |
| 3 | 216 | 0.5% |
poutcome
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| "nonexistent" | |
|---|---|
| "failure" | |
| "success" | 1373 |
| Value | Count | Frequency (%) | |
| "nonexistent" | 35563 | 86.3% | |
| "failure" | 4252 | 10.3% | |
| "success" | 1373 | 3.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.45372439 |
| Min length | 9 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.08188550063 |
|---|---|
| Minimum | -3.4 |
| Maximum | 1.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | -3.4 |
|---|---|
| 5-th percentile | -2.9 |
| Q1 | -1.8 |
| median | 1.1 |
| Q3 | 1.4 |
| 95-th percentile | 1.4 |
| Maximum | 1.4 |
| Range | 4.8 |
| Interquartile range (IQR) | 3.2 |
Descriptive statistics
| Standard deviation | 1.570959741 |
|---|---|
| Coefficient of variation (CV) | 19.18483405 |
| Kurtosis | -1.062631525 |
| Mean | 0.08188550063 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | -0.7240955492 |
| Sum | 3372.7 |
| Variance | 2.467914506 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1.4 | 16234 | 39.4% | |
| -1.8 | 9184 | 22.3% | |
| 1.1 | 7763 | 18.8% | |
| -0.1 | 3683 | 8.9% | |
| -2.9 | 1663 | 4.0% | |
| -3.4 | 1071 | 2.6% | |
| -1.7 | 773 | 1.9% | |
| -1.1 | 635 | 1.5% | |
| -3 | 172 | 0.4% | |
| -0.2 | 10 | < 0.1% |
| Value | Count | Frequency (%) | |
| -3.4 | 1071 | 2.6% | |
| -3 | 172 | 0.4% | |
| -2.9 | 1663 | 4.0% | |
| -1.8 | 9184 | 22.3% | |
| -1.7 | 773 | 1.9% |
| Value | Count | Frequency (%) | |
| 1.4 | 16234 | 39.4% | |
| 1.1 | 7763 | 18.8% | |
| -0.1 | 3683 | 8.9% | |
| -0.2 | 10 | < 0.1% | |
| -1.1 | 635 | 1.5% |
cons.price.idx
Real number (ℝ≥0)
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 93.57566437 |
|---|---|
| Minimum | 92.201 |
| Maximum | 94.767 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 92.201 |
|---|---|
| 5-th percentile | 92.713 |
| Q1 | 93.075 |
| median | 93.749 |
| Q3 | 93.994 |
| 95-th percentile | 94.465 |
| Maximum | 94.767 |
| Range | 2.566 |
| Interquartile range (IQR) | 0.919 |
Descriptive statistics
| Standard deviation | 0.578840049 |
|---|---|
| Coefficient of variation (CV) | 0.00618579684 |
| Kurtosis | -0.8298085772 |
| Mean | 93.57566437 |
| Median Absolute Deviation (MAD) | 0.38 |
| Skewness | -0.2308876514 |
| Sum | 3854194.464 |
| Variance | 0.3350558023 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 93.994 | 7763 | 18.8% | |
| 93.918 | 6685 | 16.2% | |
| 92.893 | 5794 | 14.1% | |
| 93.444 | 5175 | 12.6% | |
| 94.465 | 4374 | 10.6% | |
| 93.2 | 3616 | 8.8% | |
| 93.075 | 2458 | 6.0% | |
| 92.201 | 770 | 1.9% | |
| 92.963 | 715 | 1.7% | |
| 92.431 | 447 | 1.1% | |
| Other values (16) | 3391 | 8.2% |
| Value | Count | Frequency (%) | |
| 92.201 | 770 | 1.9% | |
| 92.379 | 267 | 0.6% | |
| 92.431 | 447 | 1.1% | |
| 92.469 | 178 | 0.4% | |
| 92.649 | 357 | 0.9% |
| Value | Count | Frequency (%) | |
| 94.767 | 128 | 0.3% | |
| 94.601 | 204 | 0.5% | |
| 94.465 | 4374 | 10.6% | |
| 94.215 | 311 | 0.8% | |
| 94.199 | 303 | 0.7% |
cons.conf.idx
Real number (ℝ)
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -40.50260027 |
|---|---|
| Minimum | -50.8 |
| Maximum | -26.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | -50.8 |
|---|---|
| 5-th percentile | -47.1 |
| Q1 | -42.7 |
| median | -41.8 |
| Q3 | -36.4 |
| 95-th percentile | -33.6 |
| Maximum | -26.9 |
| Range | 23.9 |
| Interquartile range (IQR) | 6.3 |
Descriptive statistics
| Standard deviation | 4.628197856 |
|---|---|
| Coefficient of variation (CV) | -0.1142691537 |
| Kurtosis | -0.3585583105 |
| Mean | -40.50260027 |
| Median Absolute Deviation (MAD) | 4.4 |
| Skewness | 0.3031798587 |
| Sum | -1668221.1 |
| Variance | 21.4202154 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| -36.4 | 7763 | 18.8% | |
| -42.7 | 6685 | 16.2% | |
| -46.2 | 5794 | 14.1% | |
| -36.1 | 5175 | 12.6% | |
| -41.8 | 4374 | 10.6% | |
| -42 | 3616 | 8.8% | |
| -47.1 | 2458 | 6.0% | |
| -31.4 | 770 | 1.9% | |
| -40.8 | 715 | 1.7% | |
| -26.9 | 447 | 1.1% | |
| Other values (16) | 3391 | 8.2% |
| Value | Count | Frequency (%) | |
| -50.8 | 128 | 0.3% | |
| -50 | 282 | 0.7% | |
| -49.5 | 204 | 0.5% | |
| -47.1 | 2458 | 6.0% | |
| -46.2 | 5794 | 14.1% |
| Value | Count | Frequency (%) | |
| -26.9 | 447 | 1.1% | |
| -29.8 | 267 | 0.6% | |
| -30.1 | 357 | 0.9% | |
| -31.4 | 770 | 1.9% | |
| -33 | 172 | 0.4% |
| Distinct | 316 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.621290813 |
|---|---|
| Minimum | 0.634 |
| Maximum | 5.045 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0.634 |
|---|---|
| 5-th percentile | 0.797 |
| Q1 | 1.344 |
| median | 4.857 |
| Q3 | 4.961 |
| 95-th percentile | 4.966 |
| Maximum | 5.045 |
| Range | 4.411 |
| Interquartile range (IQR) | 3.617 |
Descriptive statistics
| Standard deviation | 1.734447405 |
|---|---|
| Coefficient of variation (CV) | 0.4789583313 |
| Kurtosis | -1.406802622 |
| Mean | 3.621290813 |
| Median Absolute Deviation (MAD) | 0.108 |
| Skewness | -0.7091879564 |
| Sum | 149153.726 |
| Variance | 3.0083078 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4.857 | 2868 | 7.0% | |
| 4.962 | 2613 | 6.3% | |
| 4.963 | 2487 | 6.0% | |
| 4.961 | 1902 | 4.6% | |
| 4.856 | 1210 | 2.9% | |
| 4.964 | 1175 | 2.9% | |
| 1.405 | 1169 | 2.8% | |
| 4.965 | 1071 | 2.6% | |
| 4.864 | 1044 | 2.5% | |
| 4.96 | 1013 | 2.5% | |
| Other values (306) | 24636 | 59.8% |
| Value | Count | Frequency (%) | |
| 0.634 | 8 | < 0.1% | |
| 0.635 | 43 | 0.1% | |
| 0.636 | 14 | < 0.1% | |
| 0.637 | 6 | < 0.1% | |
| 0.638 | 7 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5.045 | 9 | < 0.1% | |
| 5 | 7 | < 0.1% | |
| 4.97 | 172 | 0.4% | |
| 4.968 | 992 | 2.4% | |
| 4.967 | 643 | 1.6% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5167.035911 |
|---|---|
| Minimum | 4963.6 |
| Maximum | 5228.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 4963.6 |
|---|---|
| 5-th percentile | 5017.5 |
| Q1 | 5099.1 |
| median | 5191 |
| Q3 | 5228.1 |
| 95-th percentile | 5228.1 |
| Maximum | 5228.1 |
| Range | 264.5 |
| Interquartile range (IQR) | 129 |
Descriptive statistics
| Standard deviation | 72.25152767 |
|---|---|
| Coefficient of variation (CV) | 0.01398316732 |
| Kurtosis | -0.003760375696 |
| Mean | 5167.035911 |
| Median Absolute Deviation (MAD) | 37.1 |
| Skewness | -1.044262407 |
| Sum | 212819875.1 |
| Variance | 5220.28325 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 5228.1 | 16234 | 39.4% | |
| 5099.1 | 8534 | 20.7% | |
| 5191 | 7763 | 18.8% | |
| 5195.8 | 3683 | 8.9% | |
| 5076.2 | 1663 | 4.0% | |
| 5017.5 | 1071 | 2.6% | |
| 4991.6 | 773 | 1.9% | |
| 5008.7 | 650 | 1.6% | |
| 4963.6 | 635 | 1.5% | |
| 5023.5 | 172 | 0.4% |
| Value | Count | Frequency (%) | |
| 4963.6 | 635 | 1.5% | |
| 4991.6 | 773 | 1.9% | |
| 5008.7 | 650 | 1.6% | |
| 5017.5 | 1071 | 2.6% | |
| 5023.5 | 172 | 0.4% |
| Value | Count | Frequency (%) | |
| 5228.1 | 16234 | 39.4% | |
| 5195.8 | 3683 | 8.9% | |
| 5191 | 7763 | 18.8% | |
| 5176.3 | 10 | < 0.1% | |
| 5099.1 | 8534 | 20.7% |
y
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| "no" | |
|---|---|
| "yes" |
| Value | Count | Frequency (%) | |
| "no" | 36548 | 88.7% | |
| "yes" | 4640 | 11.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.112654171 |
| Min length | 4 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Unnamed: 0 | age | job | marital | education | default | housing | loan | contact | month | day_of_week | duration | campaign | pdays | previous | poutcome | emp.var.rate | cons.price.idx | cons.conf.idx | euribor3m | nr.employed | y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 56 | "housemaid" | "married" | "basic.4y" | "no" | "no" | "no" | "telephone" | "may" | "mon" | 261 | 1 | 999 | 0 | "nonexistent" | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | "no" |
| 1 | 1 | 57 | "services" | "married" | "high.school" | "unknown" | "no" | "no" | "telephone" | "may" | "mon" | 149 | 1 | 999 | 0 | "nonexistent" | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | "no" |
| 2 | 2 | 37 | "services" | "married" | "high.school" | "no" | "yes" | "no" | "telephone" | "may" | "mon" | 226 | 1 | 999 | 0 | "nonexistent" | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | "no" |
| 3 | 3 | 40 | "admin." | "married" | "basic.6y" | "no" | "no" | "no" | "telephone" | "may" | "mon" | 151 | 1 | 999 | 0 | "nonexistent" | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | "no" |
| 4 | 4 | 56 | "services" | "married" | "high.school" | "no" | "no" | "yes" | "telephone" | "may" | "mon" | 307 | 1 | 999 | 0 | "nonexistent" | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | "no" |
| 5 | 5 | 45 | "services" | "married" | "basic.9y" | "unknown" | "no" | "no" | "telephone" | "may" | "mon" | 198 | 1 | 999 | 0 | "nonexistent" | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | "no" |
| 6 | 6 | 59 | "admin." | "married" | "professional.course" | "no" | "no" | "no" | "telephone" | "may" | "mon" | 139 | 1 | 999 | 0 | "nonexistent" | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | "no" |
| 7 | 7 | 41 | "blue-collar" | "married" | "unknown" | "unknown" | "no" | "no" | "telephone" | "may" | "mon" | 217 | 1 | 999 | 0 | "nonexistent" | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | "no" |
| 8 | 8 | 24 | "technician" | "single" | "professional.course" | "no" | "yes" | "no" | "telephone" | "may" | "mon" | 380 | 1 | 999 | 0 | "nonexistent" | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | "no" |
| 9 | 9 | 25 | "services" | "single" | "high.school" | "no" | "yes" | "no" | "telephone" | "may" | "mon" | 50 | 1 | 999 | 0 | "nonexistent" | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | "no" |
Last rows
| Unnamed: 0 | age | job | marital | education | default | housing | loan | contact | month | day_of_week | duration | campaign | pdays | previous | poutcome | emp.var.rate | cons.price.idx | cons.conf.idx | euribor3m | nr.employed | y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 41178 | 41178 | 62 | "retired" | "married" | "university.degree" | "no" | "no" | "no" | "cellular" | "nov" | "thu" | 483 | 2 | 6 | 3 | "success" | -1.1 | 94.767 | -50.8 | 1.031 | 4963.6 | "yes" |
| 41179 | 41179 | 64 | "retired" | "divorced" | "professional.course" | "no" | "yes" | "no" | "cellular" | "nov" | "fri" | 151 | 3 | 999 | 0 | "nonexistent" | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | "no" |
| 41180 | 41180 | 36 | "admin." | "married" | "university.degree" | "no" | "no" | "no" | "cellular" | "nov" | "fri" | 254 | 2 | 999 | 0 | "nonexistent" | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | "no" |
| 41181 | 41181 | 37 | "admin." | "married" | "university.degree" | "no" | "yes" | "no" | "cellular" | "nov" | "fri" | 281 | 1 | 999 | 0 | "nonexistent" | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | "yes" |
| 41182 | 41182 | 29 | "unemployed" | "single" | "basic.4y" | "no" | "yes" | "no" | "cellular" | "nov" | "fri" | 112 | 1 | 9 | 1 | "success" | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | "no" |
| 41183 | 41183 | 73 | "retired" | "married" | "professional.course" | "no" | "yes" | "no" | "cellular" | "nov" | "fri" | 334 | 1 | 999 | 0 | "nonexistent" | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | "yes" |
| 41184 | 41184 | 46 | "blue-collar" | "married" | "professional.course" | "no" | "no" | "no" | "cellular" | "nov" | "fri" | 383 | 1 | 999 | 0 | "nonexistent" | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | "no" |
| 41185 | 41185 | 56 | "retired" | "married" | "university.degree" | "no" | "yes" | "no" | "cellular" | "nov" | "fri" | 189 | 2 | 999 | 0 | "nonexistent" | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | "no" |
| 41186 | 41186 | 44 | "technician" | "married" | "professional.course" | "no" | "no" | "no" | "cellular" | "nov" | "fri" | 442 | 1 | 999 | 0 | "nonexistent" | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | "yes" |
| 41187 | 41187 | 74 | "retired" | "married" | "professional.course" | "no" | "yes" | "no" | "cellular" | "nov" | "fri" | 239 | 3 | 999 | 1 | "failure" | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | "no" |